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Abstract 

We consider the problem of finding edges of a hidden weighted graph using a certain type of queries. 
Let G be a weighted graph with n vertices. In the most general setting, the n vertices are known and no 
other information about G is given. The problem is finding all edges of G and their weights using additive 
queries, where, for an additive query, one chooses a set of vertices and asks the sum of the weights of 
edges with both ends in the set. This model has been extensively used in bioinformatics including genom 
| sequencing. Extending recent results of Bshouty and Mazzawi [TT] , and Choi and Kim [T7] , we present a 

polynomial time randomized algorithm to find the hidden weighted graph G when the number of edges 
in G is known to be at most m > 2 and the weight w(e) of each edge e satisfies a < \w(e)\ < (3 for 
fixed constants a,/3 > 0. The query complexity of the algorithm is Q( " Q ^" ), which is optimal up to a 
constant factor. 

The algorithm heavily relies on a well-known combinatorial search problem, which may be of inde- 
pendent interest. Suppose that there are n identical looking coins and some of them are counterfeit. 
The weights of all authentic coins are the same and known a priori. The weights of counterfeit coins 
vary but different from the weight of an authentic coin. Without loss of generality, we may assume the 
weight of authentic coins is 0. The problem is to find all counterfeit coins by weighing sets of coins on 
a spring scale. We introduce a polynomial time randomized algorithm to find all counterfeit coins when 
the number of them is know to be at most m > 2 and the weight w(c) of each counterfeit coin c satisfies 
a < \w(c)\ < (3 for fixed constants a,/3. The query complexity of the algorithm is O C 7 ^ 1 "^" ), which is 
optimal up to a constant factor. The algorithm uses, in part, random walks. 

Keywords - graph finding, combinatorial search, coin weighing, additive query, random walk 
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• ' 1.1 Graph Finding Problem 

We consider the problem of finding edges of a hidden weighted graph using a certain type of queries. Let 
7— ( . G be a weighted graph with n vertices. In the most general setting, the n vertices are known and no other 
j>! | information about G is given. The problem is finding all edges of G and their weights using queries. Three 
' types of queries have been extensively studied: 

^ \ Detection query: One chooses a set of vertices and asks if there is an edge with both ends in the set. 
This type of queries has applications to genom sequencing and has been studied in [U IU O HI [2U [25] . 

Additive query: One chooses a set of vertices and asks the sum of weights of edges with both ends in 
the set. This model has been extensively used in bioinformatics including genom sequencing, and studied 
in [31 El El El [lOl [III [111 [231 [251 [261 [Ml M- 

Shortest path query: One choose a pair of vertices and asks the length of the shortest path between the 
two vertices. This query arises in the canonical model of the evolutionary tree literature |27l 129 } HP]. 

(Our lists of references are far from being exhaustive.) 

In this paper, we focus on the additive queries. The graph finding problem with additive queries is 
partly motivated by the shotgun sequencing [25] , one of the most popular methods for DNA sequencing. 
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In the shotgun sequencing, one needs to put back separately decoded short fragments of a given genome 
sequence into the same order as in the original sequence. Combined with a biotech method called the 
multiplex PCR [33], the process is reduced to the problem of finding a hidden graph using additive queries. 
The additive queries are also used in the problem of finding the Fourier coefficients of pseudo-Boolean func- 
tions, which play crucial roles in evolutionary computation, artificial intelligence, and population genetics 

[lElCESlCE]. 

In the rest of this paper, we say queries for additive queries and all logarithms are in base 2, unless 
otherwise specified. For unweighted graphs, Grebinski and Kucherov presented a few results. For arbitrary 

2 

graphs on n vertices, they have shown that queries are enough |26j . If the hidden graph is known 

to be a Hamiltonian path or cycle, then O(n) queries are suffice [25| . More generally, if the maximum 
degree of the hidden graph is known to be at most d, then the graph may be found using 0(dn) queries 
|26j . Grebinski [23] has shown that the same bound 0(dn) holds for d-degenerate graphs. 

When the hidden graph has at most m > 2 edges and m is known, some bounds close to the optimal 
bound were shown [3] [39] and Choi and Kim [18] proved a 0( ml °pg n m ) bound that is optimal (up to 
a constant factor). The randomized algorithm presented there uses non-adaptive queries but it is not a 
polynomial time algorithm, where queries are non-adaptive if each query is independent of answers to the 
previous queries. Recently, Mazzawi [36] constructed a polynomial time algorithm with optimal query 
complexity. The algorithm is deterministic and uses adaptive queries. She also extended the algorithm to 
find weighted graphs with positive integer weights. 

For weighted graphs, Choi and Kim [18] proved a non-adaptive O C^o'gm" ) Q uer y bound, provided that 
m is at least a polylog of n and the absolute values of all weights are between n~ a and n b for constants 
a, b > 0. Bshouty and Mazzawi [11] showed the same bound without the extra conditions. However, it 
is unlikely that one may able to develop a polynomial time algorithm from those results. In other words, 
substantially new ideas seem to be needed to design an algorithm that is useful in practical sense. A 
significant result toward this direction has been shown by Bshouty and Mazzawi [9j: For weighted graphs 
with positive real weights, they presented a deterministic polynomial time algorithm that uses an almost 
optimal number of (adaptive) queries, 0( ™ o 1 °^ t ?1 + m log log m) . Note that the extra m log log m term is 
larger than the optimal query bound by a log log n factor when logm = 0,(}ogn). 

To obtain the optimal query complexity 0( ™ o 1 °^"' ), Choi and Kim |17] have recently introduced a 
randomized polynomial time algorithm that finds the hidden weighted graph with positive real weights. 
Another randomized polynomial time algorithm they introduced uses O ^J"^" ) queries to find the hidden 
weighted graph with bounded integer weights. 

In this paper, we present a randomized polynomial time algorithm that works for a quite general class 
of weighted graphs. Using the optimal number of queries up to constant factor, the algorithm finds the 
hidden weighted graph provided that the weight w(e) of each edge e in the graph satisfies a < \w(e)\ < j3 
for positive constants a and /3. The theorem we will prove is slightly more general in the sense that a, (3 
are not necessarily constants. 

Theorem 1.1. Let n,m be positive integers with n 2 > m > 2 and let a,j3 > be positive real numbers (not 
necessarily constants) with 2a < (3. Suppose a weighted graph G with n vertices and at most m edges is 
given. If the weight w(e) of each edge in G satisfies a < \w(e)\ < (3, then there is a randomized polynomial 
time algorithm that asks O ( m l ° S [^g ^ ~ ) Q uer "i' es t° find all edges with probability 1 — O(l/m ' 02 ). 

Our proof of the theorem heavily relies on a well-known combinatorial search problem. Suppose there 
are n identical looking coins and some of them are counterfeit. The weights of all authentic coins are the 
same and known a priori. The weights of counterfeit coins vary but different from the weight of an authentic 
coin. The problem is to find all counterfeit coins by weighing sets of coins on a spring scale. Note that 
weighing sets of coins on a spring scale may be regarded as additive queries. This problem is also equivalent 
to the graph finding problem when the graphs are restricted to stars K\ m with known center. The coin 
weighing problem has been extensively studied. We survey its colorful history and add one more algorithm 
finding all counterfeit coins when the weights of each counterfeit coin satisfies properties similar to those 
described in the above theorem. 
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1.2 Coin Weighing Problem 

Suppose there are n identically looking coins, some of them are counterfeit. The weights of all authentic 
coins are the same and known a priori, while the weights of counterfeit coins are unknown but different 
from the weight of an authentic coin. Without loss of generality, it may be assumed that the weights of 
authentic coins are and the weights of counterfeit coins belong to a set of non-zero real numbers. We 
want to find all counterfeit coins by weighing sets of coins on a spring scale, which we call additive queries 
or simply queries. 

After the coin weighing problem was introduced by Fine [22] and Shapiro [H], a number of results 
have been published, mainly focusing on the case that the weights of counterfeit coins are the same \\.2\ 
[T3"l EH EH E21 [331 EH H2]: Summarizing some of them briefly, Erdds and Renyi [21J, in 1963, proved that 
\ogn^ n <l uer i es are enough and ^°g^" queries are required. (See [30] for another proof of the lower 
bound.) The upper bound was improved to match the lower bound by Cantor and Mills [13] . and Lindstrom 
[32] . Using the Mobius function, Lindstrom [331134] explicitly constructed a query matrix that asks ^"["pg^" 
queries. The case that the number m of counterfeit coins is also known has been extensively studied too 
[HI EH [2Ql EH [Ml ESI EH H5] . Recently, Bshouty [7] proposed the first polynomial time algorithm that uses 

(l+o(l))2mlog ^ , ,. -mi i ■ i . 

j^-^ — adaptive queries. Ine query complexity is optimal up to o(l) term. 

Results for the general case, in which the weights of counterfeit coins are not the same, have been 
obtained only recently. As the results were applied to the (weighted) graph finding problem, our summary 
is almost the same as in the previous subsection. When the weights of the counterfeit coins can be any 
(not necessarily positive) real numbers, Choi and Kim [I8j proposed an algorithm with a non-adaptive 
^(log?7?) cL uer y bound, under the mild conditions on m and the weights, i.e., m = fi(polylogra) and the 
absolute values of all weights are between n~ a and n b for constants a, b > 0. Bshouty and Mazzawi [ll] 
showed the same bound without the extra conditions. Though the query complexities of both algorithms 
are optimal, the time complexities of them are far from being polynomial. Concerning polynomial time 
algorithms, Bshouty and Mazzawi [9] presented a deterministic polynomial time algorithm that uses a near 
optimal number of (adaptive) queries, O C^ 1 "^" + to log logm), assuming the weights of all counterfeit coins 
are positive real numbers. They first constructed a search matrix using Fourier representations, and took 
the divide and conquer approach to guess the sums of the weights of coins. The search matrix played key 
roles when the sums of the weights were guessed. The processes for checking and correction follow after 
guessing. 

As mentioned before, the extra m log log m term is larger than the optimal bound by a log log n factor 
when logm = f2(logn). Choi and Kim [T7] presented a polynomial time randomized algorithm to remove 
the m log logm term in the query complexity. Another polynomial time randomized algorithm may be 
applied to achieve the optimal query complexity, when the weights of counterfeit coins are bounded integers 
in absolute values. The key idea is constructing random sets of coins that are useful to control the number 
of checking and correction processes used by Bshouty and Mazzawi [9]. Once the number of checking and 
correction processes is substantially reduced, less queries are needed. 

A randomized algorithm is presented in this paper to achieve the optimal query complexity when the 
weights of counterfeit coins are any real numbers bounded from below and from above in absolute values. 
The theorem we will prove is slightly more general in the sense that some exceptions for the weight condition 
are allowed. 

Theorem 1.2. Let n, m be positive integers with n > m > 2 and let a, /3, e > be positive real numbers (not 
necessarily constants) with 2a < /3,e < 1/2. Suppose n coins are given and there are at most m counterfeit 
coins among them. The weights of authentic coins are while the wights of counterfeit coins vary but they 
are non-zero. If the weights w(c) of all but em counterfeit coins c satisfy a < |w(c)| < /3 and the weights 
w(c) of the em counterfeit coins c satisfies just \w(c)\ < f3, then there is a randomized polynomial time 
algorithm that asks 0( m log jfg^ log - ) queries and finds all butmP' 8 + 2em counterfeit coins, with probability 
1 — 0(l/m 0,8 ). All the remaining counterfeit coins can be found using O((m ' 8 + 2em)logn) additional 
queries, with probability 1 — e~^ m ' \ 
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In the proof of Theorem 1 1.2 1 we use the search matrix Bshouty and Mazzawi [9] developed after constructing 
random sets of coins as in Choi and Kim [T7J. Though the guessing processes are the same as in [17], the 
processes for checking and correction are newly developed using biased random walks. 

One may easily verify if the coins declared to be counterfeit by the algorithm in Theorem 11.21 are 
actually counterfeit by directly weighing, using m additional queries. Running the algorithm 0(/i) times 
with the verification at each time, the error probability may be made arbitrarily small. 

Corollary 1.3. Under the same hypotheses of Theorem I and any integer \i > 1, there is a randomized 
polynomial algorithm that uses O ( ^ m l ° g ifJ^ log " ) queries and finds all but m 8 + 2em counterfeit coins 
with probability 1 — 0(l/m^). All the remaining counterfeit coins can be found using 0((m 0,8 + 2em) log re) 
additional queries, with probability 1 — e~^ m ' . 

After presenting the search matrix and two martingale inequalities in Section [21 we prove Theorem 
11.21 in Section [3l Section [5] is for the proof of Theorem 11.11 The concluding remark will follow. 



2 Preliminaries 

As mentioned in the previous section, Bshouty and Mazzawi [9] used Fourier representation of certain 
functions to find a search matrix, i.e., a 0, 1 matrix that is useful for coin weighing problems. We present 
properties of the matrix in a slightly generalized form. 

Lemma 2.1. Let 7,771 be positive integers. Then, for the smallest integer t satisfying t2 t ~ 1 > 7777, one can 
construct, in polynomial time, 2* x m 0, 1 matrix S and 2* x 2* matrix T with the following property: For 
each j = 1, ...,777, one may find, in polynomial time, a unique positive integer ij < 2* and a non-negative 
integer kj < [i/VI — 1 satisfying 

(TS) i>k = 2^ k -^ a (TS) yl for j + l<k<j + kj, and (TS% k = for k > j + kj + 1, 

where (TS)ij is the ij entry ofTS. 



Setting ajk = ^ TS y , we have the following corollary. 



Corollary 2.2. Let 7,777 are positive integers and t be the smallest integer satisfying t2 > 7777. Then 
one can find, in polynomial time, 2* non-adaptive queries, real numbers djk, and a non-negative integer 
kj < \t/ f y] — 1, j = 1, 777, k = 1, j — 1, satisfying the following property: For disjoint sets A\, ...,A m of 
coins, the 2 queries yield values polynomial time, satisfying 

j — l kj 

w(Aj) = X j - aj k w(A k ) - 2^ 



2kc 

k=l k=l 



j = 1, 777 ; where w(A) is the sum of weights of all coins in A. In particular, ^ 2 ^°|^^ m queries are enough 
to find Xj 's. 

We will need the Azuma-Hoeffding martingale inequality too. The following is from |37j . 

Lemma 2.3. Let Z = (Z±, . . . , Zt) be a family of independent random variables with Z^ taking values in a 
finite set Bi for each I. Suppose that the real-valued function f defined on Y\ e Bi satisfies 

|/(z)-/(z')|<c, 

whenever the vectors z and z' differ only in the £ th coordinate. Then for any A > 0, 

Pr[|/(Z)-E[/(Z)]| > A] < 2e~ 2A2 /^l 
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For our purpose, a more general martingale inequality is needed. The following version appeared in 

EE]- 

Lemma 2.4. Let X = (Zi, . . . , Zt) be independent identically distributed (i.i.d.) Bernoulli random variables 
with probability p (i.e., Pr[Zi = 1] = p and Pr[Zi = 0] = 1 — p for each i). Suppose that the real-valued 
function f defined on {0, 1}* satisfies 

|/(z)-/(z')|<c ? . 

whenever the vectors z and z' differ only in the i th coordinate. Then for any A, p > 0, 

t 

Pr [\f{Z) - E[f(Z)} | > A] < 2 exp ( - pX + (p 2 /2)p(l - p) £ c? exp(p Ci )) . 

i=l 



3 Coin Weighing Problem 

Suppose n coins are given, some of which are counterfeit. The weights of all authentic coins are the same 
and known a priori, while the weights of counterfeit coins are unknown but different from the weight of an 
authentic coin. Without loss of generality, we may assume that the weights of authentic coins are and 
the weights of counterfeit coins belong to a set of non-zero real numbers. We assume that the number of 
counterfeit coins is known to be at most m. 

If O (to logn) queries are allowed to find counterfeit coins. One may use a randomized binary search: 

Randomized Binary Search Suppose a set A of coins is given, and the number of coins is no more than 
n and there are at most m < n counterfeit coins. Then select each coin with probability 1/2, independently 
of all other coins. Then weigh the set A' of selected coins. If the weight is non-zero, then find a counterfeit 
coin among the selected coins, using the deterministic binary search. 

The deterministic binary search is as follows. Divide A' into two parts A[, A' 2 with size difference at most 1. 
If w(A' 1 ) ^ 0, then select A[. Otherwise, select A' 2 . Keep doing this for the selected set until a counterfeit 
coin is found. 

Provided that there is a counterfeit coin, it is not hard to see that the probability of the weight of A' 
being non-zero is at least 1/2 and the deterministic binary search requires no more than [logn] queries. 
The number of queries required to find one counterfeit coin is at most 2+ [logn] in expectation. Thus, it is 
expected that ([logn] + 2 + o(l))m queries are enough to find all counterfeit coins, with hight probability. 
Here, we show that ( [log n] + 3)m queries are enough, with probability 1 — e -f ^ m ) . 

Lemma 3.1. With probability 1 — e _ ^( m ) ; the randomized binary search finds all counterfeit coins using 
([logn] + 3)m queries. 

The proof of the lemma is presented in Appendix. 

We first construct random sets of coins and then present the algorithm, for which the time complexity 
is not optimized but it is clearly a polynomial time algorithm. Some explanation and analysis of the 
algorithm will follow after the algorithm is presented. The construction of random sets is the same as in 
Choi and Kim [17] . 

Constructing random sets of coins: Let A be a set of n or less coins. For an integer q > 2 and 
l q := [log 9], we construct random subsets Ajj of A, i = 0,1,..., [3 logn], j = l,...,2 £q+l . For i = 0, 
we assign each coin in A a uniform random number among 1, 2", independently of all other coins. 
The set Aqj consists of all coins with assigned number j. Generally, for i = 1,..., [2 log 5] — 1, once 
all j = 1, 2 " +l ~ x , are constructed, we may randomly divide each set j4j-i,j into two parts so 

that coins in j are independently in the first part with probability 1/2. The other coins in Aj-ij 
are to be in the second part. The set of all coins in the first and second parts are denoted by Ai^j-x 
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and Ai 2j, respectively. Or equivalently, after assigning each coin mutually independent random numbers 
r ,r 1 , ---j r r 2 iog«i-i' independently of all other coins, with 

Pr[r = a] = 2~ iq , a = l,...,2 e " and Pr[r t = a] = i, a = 0,1, i = 1, [2 log g] - 1, 

we define i4jj to be the set of all coins with assigned numbers r ,r x , ? r p 2 i ogg i -i satisfying j = 1 + (r — 

l)2*+r 1 2 i - 1 + ---+r. 

For i > [2 log 5], may be deterministically divided into two parts so that the first part consists 

of [| J 4j_i J '|/2] coins. As before, the first part is denoted by A^j-i, and ^4i,2j = \ This 

construction is expected to stop when all Aij, j = 1, ...,2 £m+t , consist of one or no coin. As there are n 
coins, all A%j consist of one or no coin within [log re] more rounds after the random construction ends. For 
the sake of safeness, we stop the construct when i = [3 log re] > [~2 log q] + [log re] . 

The following lemma summarize properties of the random subsets A% j that will be used for the analysis 
of the algorithm presented later. The proof is essentially in |17| and it is presented in Appendix for the 
sake of completeness. 

Lemma 3.2. Suppose a set A of n or less coins are given, and the number of counterfeit coins in A is at 
most q > 2. If the weights w{c) of all but at most q/2 counterfeit coins c satisfy \w(c)\ > a. Then, with 
probability 1 — O(-), we have the followings. 

(a) There are at most ^? counterfeit coins c that satisfy \w(c)\ < a (not exclusive) or belong to a set Aqj 
containing more than one counterfeit coin, j = l,.,.,2r q . 

(b) For each i = 1, [2 log q~\ — 1, Aij contains at most t+2 | og9 counterfeit coins. 

(c) For each i = 1, [21ogg] — 1, there are at most 2~^ l+l " > q + g 3//4 sets Aij that contain more than one 
counterfeit coin. 

(d) Fori > [2 log 9] — 1, each A^j contains one or less counterfeit coin. 

(e) Each ^4.r3i g n ij contains at most one coin. 

Now we are ready to present the algorithm described in Theorem 11.21 

Algorithm (i) (Initially, q = m and A is the set of all re coins.) Construct random subsets A%j of A as 
above with parameter q. Then weigh Aqj for all j = 1, 2 lq , and denote Wqj = w(Aoj) j = 1, 2 lq and 
Jo to be the set of all j such that |u>o,j| > ol. Then go to (ii), where, in general, w(B) = ^cgb w ( c ) f° r a 
set B of coins. 

(ii) (Initially i = 1 and J = Jq.) After relabeling, we may assume J = {1, ...,\J\}. Apply Corollary 12.21 with 
7 4 = max{[log(^)], [log( 3/3(t+ . 21ogg) )] } to A ij2 , A it2 \ j\ and obtain x r satisfying 

w(j4j 2(r+fc)) 

a rk w{A i>2 k) - 2^ • (!) 

fc=l fc=i 

Set, inductively in r = 1, | J|, 

{Wi-i-r if \x r - Y^l=\ a rkU 2 k\ > f 
(2) 
otherwise, 

and u 2r _ 1 = Wi-i :r — u 2r , r = 1, \J\. Go to (hi) if i < [21ogg]. Otherwise, go to (iv). 

(iii) (Initially, s = —2.) Randomly select each j satisfying u. = and j < min{s, 2\J\} with probability 1/2, 
independently of all other j. Weigh L){Aij : selected j}. The weight is if no j is selected. Do this random 
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weighing |~log(i 2 + 1)] +3 times, independently of all other random weighings. This procedure is called a 
random test at s. If the test is passed, i.e., all weights are 0, then update s to be s + 2i 2 . If it is failed 
and s < 2\J\, correct u s by weighing Ai iS , that is, weigh Ai >a , and update u s to be w(Ai )S ) and u s _i to be 
w i-\,s/2 ~ u s- (Note that s is even.) Update also Uj for all j > s according to ([2]) and u 2r _ 1 = u>i-i, r — u 2r . 
If the test is failed and s > 2\J\, then do nothing. Update s to be s — 2 for both cases. This step including 
all updating is to be called a correction step of u s , or simply a correction step, even for s > \J\. It does not 
necessarily mean that u s was not w(Ai tS ) just before the correction step though. 

If s < 2\J\ + 8i 2 logq, repeat (hi) with updated s. Otherwise, let Wij = Uj, j = 1, 2| J\. Then return to 
the original label and update i, J to be i + 1, {j : Wij is defined and \wij\ > a}, respectively, and go to 
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(iv) Set Wij = uj, j = 1, 2\J\. Then return to the original label and update J to be {j : Wij is defined 
and \wij\ > a}. If i < [~31ogn], then go to (ii) after updating i to be i + 1. If i = [~31ogn], then output J 
and declare that all coins in Uj^jAij are counterfeit. Remove all coins that are declared counterfeit from 
the set A of all coins and update q to be 5q/6. If q > m. 0,8 + 2em go to (i). Otherwise, go to (v). 

(v) Apply the randomized binary search to find counterfeit coins one by one, using ([logn] +3)(m 0,8 + 2em) 
queries. 

The core parts of the algorithm are (ii) and (hi). If Wi—\ j = w(Ai—\j) and every set contains 

at most one coin, then w(Ai ) 2j) = or Provided | Ylk=i ^^^-J^^ I is small enough, say less than 

a/2 (see (a) lemma l33|) . it is not hard to show that U2 r = w(Ai^ r ) and U2 r -i = w{Ai^ T -i) for all r. (See 
Corollary 13.51 ) This was one of main ideas of Bshouty and Mazzawi [9]. In general, as some sets ^4i-i,j 
contain more than one counterfeit coin, ii2 r may or may not be w{Ai^2r)- 

If r is the smallest r with «2r 7^ ^(Ai^r), u 2r > = w^Ai^r'), r' > r, is not guaranteed any more even if 
the set Ai_iy contains only one counterfeit coin. This is why we introduced random tests and correction 
steps in (hi). The random tests generate a random walk that travels according to the value of s. It turns 
out that the walk goes forward until it passes or at 2r. Once the random walk passes or at 2r, it goes 
backward with a probability close to 1 (not extremely close to 1 though). It is expected that the random 
walk with correction steps quickly identifies and corrects u 2r . 

Moreover, it turns out that r is the smallest r with U2 r ^ w{Ai^ r ) only if ^4j_i >r contains more than 
one counterfeit coin. If not many sets Aj_i jr contain more than one counterfeit coin (see (c) of Lemma [3 .2p . 
the number of queries asked to identify and correct corresponding U2r's seems to be reasonably small. In 
other words, the lesser the number of sets Aj_i iT . containing more than one counterfeit coin is, the faster s 
increases. Eventually, s keeps increasing after all corresponding «2r's are corrected. 

Remark, (a) Though the initial value —2 of s looks somewhat strange, it is natural as s = 2r — 2 when 
U2r is corrected and the initial value must be determined as if the imaginary uq were corrected. 

(b) When the random test fails, it may be tempting to find Ai^ r with w(Ai t 2 r ) ^ 0, say using a binary 
search. However, the number of queries needed to find such a set can be as large as O(logq), while our 
algorithm is expected to correct U2 T using 0(i 2 log(z 2 + 1)) queries. This save queries when i is small. 
Though the bound is not extremely good if i is large, it is not really a matter as there are much less sets 
^4j_i.j containing more than one counterfeit coin. (See (c) of Lemma 13.21 ) 

To analyze the algorithm, we precisely summarize core properties of the algorithm. 

Lemma 3.3. Suppose (a)-(e) of Lemma \3. 2\ hold for q and Wi-ij = w(Ai^ij) for a fixed i = 1, [3 log n] 
and all j = 1, \ J\. Then we have the fallowings. 

(a) For all r = 1, \J\, \ £^1 < I ■ 

(b) Ifr is the smallest r such that u 2r ^ w{Ai^ r ) when U2> r is first defined or updated, then neither w{Ai^ r -i) 
nor w(Ai t 2r) is zero, especially Ai-i >r contains more than one counterfeit coin. 
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(c) Suppose i < [~21ogg] and it. = w(Aij) for all j < 2r — 2 at a step. If s < 2r — 2 at the step, then s 
keeps increasing until s > 2r. And once s > 2r, s > 2r at all the following steps except possibly one step, 
which is a correction step of u 2r and s = 2r — 2. 

Proof, (a) For i < [21ogg], since i < 21ogg, 2 H > W+|!2M) and \ w (A h2{r+k) )\ < Mi±|Msl (as A .^ r+k) 
contains at most t+2 * ogg counterfeit coins by (b) of Lemma l3.2[) . we have 

I w{A it2 (r + k)) I y> fi(i + 21og q) < a a y> / 2alogg \k a " (]\ k « 
| 2-/ 2 fc T. I ~ f-f T ; f Mi+iigsg) ^ ~33 f-j 1 12/3 log g/ "3 3^16/ 2' 



fe=l fc=l H ia / fc=l fc=l 

If i > [2 log q\ , then each Aij contains one or less counterfeit coin by (d) of Lemma 13, 2\ which together 



with 2^ > ^ gives 



k r ( * \ k r n oo , oc , , 

E MA,2(r+t)) I p a a^/Q!\ fc Q a^^lxfc u 

2^ft I - 2^ 7e|u - 6 6 ^ V6pV ~6 6^ 
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< 



(b) As r is the smallest r such that u 2r ^ w{Ai i 2 r ) when u 2r is defined or updated, U2j = w(Ai t 2j) for all 
j < r and hence 

( a \ C /l \ w \ A i,2{r+k)) ^(^j,2(r+fc)J 

t«(A2r) = S P " 2^ a rM A i,2k) ~ 2^ ^ = X r - 2^ ~ ^ ^ ' 

k=l k=l k=l k=l 

If u 2r = 0, u 2r 7^ w(Aj i 2r) yields that w(Ai ) 2r) 0- On the other hand, u 2r = implies that \x r — 
Yfk=l a rk u 2k I < a /^- This together with (a) gives that (u^i^r)! < ct. Since |u>(^4j_i ir )| = \wi-ij\ > a 
and w(A^ 2r -i) = w(Ai-x,r) ~ w(A,2r), w(Ai j 2r-i) ^ °- If «2r = ^i-l,r(= ^(^i-l,r)), then tij r 7^ tL'(yli i 2r) 
yields that u^yl^r) 7^ ^(A-i,r) and hence w^^r-i) = w(^4j-i,r) — ^(^4j,2r) 7^ 0. On the other hand, 
^ = Wi-i jr implies that \x r — Ylk=i a rk u 2k I — a /^- This together with (a) gives that |tf(^4i,2r)| > 0, i.e., 
w(A i>2r ) + 

(c) We prove this by reverse induction. For r = \J\ + 1, if Uj = w(Aij) for all j < 2r — 2 = 2\J\, then 
w(Aij) = whenever Uj = 0, for all j < 2\J\. Thus, the random test must be passed and s keeps increasing 
regardless of the value of s (as no u. is updated). Suppose Uj = w(Aij) for all j < 2r — 2 with r < \J\. 
Then w(Aij) = for all j < 2r — 2 with Uj = 0. If s < 2r — 2, the random test must be passed and hence 
s increases. Once s > 2r — 2, or equivalently s > 2r (as s is even), no uj with j < 2r — 2 is updated before 
a correction step of itj r . If there is no correction step of v^ r , then s > 2r at all the following steps. If u 2r is 
corrected at a step, then s = 2r — 2 and u,- = w(Aij) for all j < 2r at the step. The induction hypothesis 
especially yields s > 2r at all steps after the correction step. 

□ 

To analyze (iii) of the algorithm for a fixed i < \2 log q] , we may regard the whole process as a random 
walk S that travels according to the value of s. That is, S = (s , s 1 , ...), where s k is the value of s at the end 
of the k th step. Note that S goes backward, i.e., s decreases, at a step if and only if the step is a correction 
step. We will see that S goes forward until it passes or at 2r for the the smallest r with u 2r 7^ w(Ai^r), and 
then S tends to go backward with probabilities close (not extremely though) to 1 until u 2r is corrected. 

We partition 5* into a few subrandom walks that are essentially independent identically distributed 
(i.i.d). They are not exactly i.i.d though. Let r = 0. The th (sub)random walk Sq (of S) starts when 
the whole process starts and ends at the same time, that is, Sq = (s ) (recall s = —2). Let r x be the the 
smallest r with u 2r 7^ w^Ai 2 r ). The first random walk S\ starts immediately after Sq ends and it ends when 
s = 2r 1 — 2 at a backward step or s > 2\J\ + 8i 2 logg for the first time, whichever comes first. Generally, 
for £ > 2, if ends with s = 2r e _ 1 — 2, then let r e be the smallest r < | J\ such that u 2r 7^ w(Ai t 2 r ) at the 
step Sg-i ends. The I th random walk Se starts immediately after <S>£_i ends, and it ends when s = 2r e — 2 
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at a backward step or s > 2| J\ + 8i 2 log q for the firs time, whichever comes first. However, Si does not end 
at a forward step with s = 2r e — 2. In theory, it is possible that Si is infinite, though it is not difficult to 
show that the probability of Si being infinite is 0. Both of r , and Si' are not defined for all £' > £, if Si-\ 
is infinite or ends with s > 2\J\ + 8i 2 logq, or u 2r = w{Ai^r) for all r < \ J\ at the last step of Sp-\. 

The random walk Si is called good if it is defined and ends with s = 2r e — 2. Note that the last step 
of a good random walk Si is the first correction step of u 2r after Si starts. In other words, a good random 
walk Si ends when it corrects u 2r where ri is the smallest r such that u 2r ^ w{Ai^ r ) when it starts. We 
also note that r e , Si are defined only if Se-i is good. In particular, Si is good only if Si' is good for all 
£' < £ - 1. 

Corollary 3.4. Under the same hypotheses as in Lemma \3.3\ with i < \2\ogq~\ — 1, we have the fallowings. 

(a) For £ > 1, if Si is good, uj = w(Aij) for all 1 < j < 2r e at the last step of Si, especially r e+1 > r e if 
r e+1 is defined. Furthermore, a good random walk Si starts with s = 2r l _ x — 2 and keeps going forward until 
s > 2r e , and then goes back and farce with s > 2r e at all steps except the last step at which s = 2r e —2. 

(b) If r e is defined, then neither w(Ai j 2r e -i) nor w(Ai ! 2r e ) is zero and contains more than one coun- 
terfeit coin. In particular, r t and Si are not defined if £ > h q := [~2~(* +1 )g + q 3 ^] . 

(c) Suppose every Si is good if defined. Then Wij = w(Aij), for all j = 1, 2\J\. 

Proof, (a) For £ > 1, suppose Uj = w(Aij) for all j < 2r e _ 1 at the last step of Si^\. (This is trivial when 
£ = 1.) Since Si is good only if Si^\ is good, the induction hypothesis may be applied to obtain r e > r t _ x 
and hence 

Uj = w(Aij) for all j <2r t —2 

at the last step of Si^\. Then, (c) of Lemma 13.31 gives that s keeps increasing (without updating it,) after 
the last step of Si-\, at which s = 2r i _ 1 — 2, until s > 2r e . Once s > 2r e , no Uj with j < 2r e — 2 is updated 
before the last step of Si. Since Si is good, u^ r is corrected and hence ui r = w{Ai^ T ), U2r-i = w(Ai^r-i) 
at the last step of Si. The second part is already shown too. 

(b) Since r t > r t _ x is defined, s = r t _ x — 2 at the last step of Si-\ and u 2r is updated at the last step of 
Si-i. By (b) of Lemma 13. 3t neither w{Ai^ r -i) nor w{Ai^ r ) is zero. The second part follows from that all 
r f 's are distinct and there are at most h q sets ^4j-i jr containing more than one counterfeit coin (see (c) of 
Lemma 13. 2|) . 

(c) For the largest £ for which r e is defined, as Si is good and r e+1 is not defined, u 2r = w{Ai^ r ) for 
all r = 1,..., |J| at the last step of Si. Since s keeps increasing after the last step and eventually s > 
2| J|+8i 2 log q without updating u j 's, we have u^r = w{Ai^r) for r = 1, | J|, and Wi^r-i = Wi-i,r~ Wi,2r = 

w(A-l,r) - w(A it 2r) = w(A it2r -l)- 

□ 

If [21ogg] <i< [31ogn], each Aij contains at most one counterfeit coin by (d) of Lemma 13.21 Then 
it easily follows that Wij = w(Aij) for all j = 1, ...,2|J|. 

Corollary 3.5. Under the same hypotheses as in Lemma HOI with [~21ogg] <i< [~31ogra], Wij = w(Aij) 
for all j = 1, 2[ J| . 

Proof. Recall that Wij = u., where u. 's are defined in (ii). Take, if any, the smallest r such that u 2r ^ 
w(Ai t 2r)- Then, (b) of Lemma 13.31 implies that Ai_i <r contains more than one counterfeit coin, which 
is not possible as each Aj_i iT . contains at most one counterfeit coin due to (d) of Lemma 13.21 Hence, 
w it 2r = w(A iy 2r) for all r = 1, | J\ and w it 2 r -i = u 2r _ 1 = w(Ai^ r ) - w(A it2r ) = w(A it2r -i)- 

□ 

Corollaries 13.41 and 13.51 provide all but one basic properties to analyze the algorithm. The missing 
property is that, with high probability, every Si is good if defined, the hypothesis of (c) of Corollary 13.41 
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For the query complexity, an upper bound for the number |«S^| of steps in Si is needed. As we want to 
bound \S(\ only for good Si, we will consider |<S^|Xf > where 



Xi 



1 if Si is good 
otherwise. 



It will be first shown that, after Se passes or at 2r e , the random walk goes to backward with probability 
at least 1 — gTp^Tn until u 2r is corrected, which follows from w(U{Ai t j : selected j}) ^ with probability 
at least 1/2 during the process. Thus, Si goes backward by at least 7/4 in expectation after Si passes or 
at 2r e , as Si goes forward by 2i 2 with probability at most gTj^fj] an< ^ S oes backward by 2 otherwise. This 
is why Si is expected to be good. The number Fi of forward steps in Si after it passes or at 2r e is also 
expected to be reasonably small, namely 0(1) with a probability close to 1. It actually turns out that the 
probability of Fi = k is at most e~ k+1 and the sum Y^i=i Q may be bounded by 0(h q ) with high enough 
probability, say with probability 1 — e _n ^ 3 A \ where h q = \2~^ i+1 ^q + g 3 / 4 J as in (b) of Lemma 13.41 

Then, it is not difficult to show that the number of all steps in Si after it passes or at 2r e is 0(i 2 Fi), 
especially there are 0(i 2 Fi) backward steps in Si by (a) of Corollary 13.41 Therefore, there are 0{i 2 h q ) 
backward steps in S with high probability. All other steps in S are forward steps and hence there are 

o{fhq + m + ^ + = o( B + (j > + „(_«_ + ,3/, 

steps in S. As 0(log(i 2 + 1)) queries are asked at each step, the number of queries asked in the i th round, 
i = 1, [2 log q] - 1, is o(q(jz + |H log(i 2 + 1)) assuming \ J\ < q. 

The precise statements are presented in the next lemma. Though idea is simple as illustrated above, 
our proof of the lemma is somewhat lengthy, partly because it is proven rigorously without referring other 
theories. We prove the lemma at the end of this section. Readers familiar with random walks may skip the 
proof. 

Lemma 3.6. Under the same hypotheses as in Lemma \3.3\ with i < [~21ogg] — 1, if U2 r ^ w{Ai^ r ) and 
s > 2r at a step, then the probability that s increases at the next step is at most 8 ^a +1 ^ ■ Moreover, every 
Si is good if defined, with probability 1 — 0(q~ 3 ), and the number \S\ of all steps satisfies 

I J I 



Pr 



!S|>^+4(* 2 + 2)(^ + S 3/4 )]=0( 9 



Correctness of the algorithm Once Lemmas 13.21 13.61 and Corollaries 13.41 13.51 are established, it is easy 
to see that the algorithm finds counterfeit coins as desired. In the next lemma, we precisely describe it 
along with a property needed to bound query complexity. 

Lemma 3.7. For a fixed q > m 0,8 + 2em, the followings hold with probability 1 — 0(l/q), assuming the 
same in the prior round. 

(a) The statements (a)-(e) of Lemma \3.2\ hold. 

(b) Whenever Wij is defined, Wij = w(Aij). In particular, a coin declared to be counterfeit must be 
counterfeit. 

(c) The algorithm finds every counterfeit coin c with \w{c)\ > a that is a unique counterfeit coin of Aq^ for 
some i = 1, 2 lq . And the number of remaining counterfeit coins is at most the updated q. 

(d) The number of queries asked in all rounds of (Hi) is 0(q), where the constant in 0(q) is at most 

V oo 5+log(» 2 + l) . (» 2 +2)(5+log(^ 2 +l)) m 

Z^=i p 1 h 
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Proof. As q > 2em and (c) holds in the prior round, Lemma [3.21 yields that the statements in (a) hold with 
probability 1 — 0(l/q). We assume the statements to prove the other properties. 

To prove the other properties, we further assume that every Si is good if defined and that, for each 
i = 1, [2 log q] — 1 and the number |5| of all steps in the z th round of (iii), 

!5|<^ + 4(, 2 + 2)(^ T + ^), (3) 

both of which hold with probability 1 — 0(q~ 3 ) by Lemma 13.61 Then the first part of (b) follows from 
(c) of Corollary 13.41 and Corollary 13.51 Since every A^ 3 i ogn ^j consists of one or no coin, each coin c in 
UjgjA|- 3 i og n -| j satisfies \w(c)\ = \w(A^ 3 i ogn ^ j)\ = |w|- 31og „-| j\ > a for some j G J, especially, c is counterfeit. 

If a coin c with |w(c)| > a is a unique counterfeit coin in Ao t £ , then, for each i = 0,..., |~31ogn], 
there is a unique £i such that An. C Aq^ contains c. It is clear, by the way how Aij's are constructed, 
that Aiit Q Aj—i^j for all i = 1,..., |~31ogn~|. Moreover, since c is a unique counterfeit coin of Ai^, 
1 10(^^)1 > a for all i = 0, [~31ogn~|. For i = 0, 1$ G J as |u>o,£ | = |iu(.Ao,£o)l — a - ^or * — 1> assuming 
^i-i € J when the prior round ends, = w(Ai ! £ i ) by (c) of Corollary [331 as £ <^ an d ^ 
Thus, |u)j^J = > a implies that li is in the updated J. We have just shown that ^ £ J when 

the z th round ends for each i, especially, for i = [31ogn]. As c € Ai^ i for i = [31ogn], c is declared to be 
counterfeit. The second part of (c) follows from (a) of Lemma 13.21 

Note that \J\ = \{j : Wij is defined and \wi t j\ > a}\ < q by the second part of (c) in the prior round 
and first part of (b), as \wij\ = \w(Aij)\ > a implies that A^j contains a counterfeit coin and the number of 
such sets is at most the number of counterfeit coins. Since the algorithm asks at most 5 + log(i 2 + 1) queries 
at each step of S (one more query is needed in backward steps), ([3]) yields that the number of queries is at 
most 

^f- 1 ( P + Mf + '»9 + 4(i , + 2)(5 + log(i , + + _ 0W) , 

i=l 

□ 



The lemma especially says that the number of remaining counterfeit coins decreases by factor 5/6, 
with probability 1 — 0(1/ q). Applying this inductively until q < m ' 8 + 2em, we know the algorithm find 
all but at most m ' 8 + 2em counterfeit coins before it goes to (v), with probability 1 — O(l/m ' 8 ). All the 



remaining m + 2em counterfeit coins are found in (v), with probability 1 — e 



-n(m° 



\ by Lemma 13.11 



Corollary 3.8. The algorithm find all but at most m ' 8 + 2em counterfeit coins before it goes to (v), with 
probability 1 — O(l/m - 8 ). All the remaining m a8 + 2em counterfeit coins are found in (v), with probability 



1 



-Q(m as ) 



, by Lemma \3A 



Query Complexity Suppose (a)-(d) of Lemma 13.71 hold for all q, which occurs with probability 1 — 
O(l/m ' 8 ). Then for each q, the number of remaining counterfeit coins is at most q. Especially, \ J\ < q as 
seen in last paragraph of the proof of Lemma 13.71 For each q, 2 iq < 2q queries are needed in (i). For each 
q and i, the number of queries asked in (ii) is 



(2 + (l)) 7 JJ| 
log(7j^l) 



< < 



Since \J\ < q and 



(2+o(l))|J| 
log|J| 

(2+o(l))|J| 
log |^| 



log( 



3/3(i+21ogg)- 



log(6/3/a) 



) [fi<\2logq] 



if i > \2 log q] . 



f21og 9 ]-l 



3/3(i + 21ogg). 



i a 



< 4 log glog(3/3/a) + log 



2r21ogg] - 1 
f2 log (/] - 1 



< 4 log q log(3/3/a) + 4 log q + 1, 
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and 

[3 log n] 

[Tog(60/a)l < 31og(6/3/a)logn + 31ogn 

i=\2logq] 

for each g, the number of queries asked in (ii) is ^ 9 log ^^ log - ^ • 

The number of all queries asked in (iii) for each q is 0(g) by (d) of Lemma 13.71 No query is asked in 
(iv) and hence the total number of queries asked for fixed q > m ' 8 + 2em is O ( 9 log ^g"g log - J • As g keeps 

decreasing by factor of 5/6, the number of queries asked before the algorithm goes to (v) is ( ml ° g (f/^ log " 



□ 

This together with Corollary 13.81 implies that, if we artificially stop the algorithm when it asks 
7?mlog(/3/a) logn q nei ^ es ^ f or ^ e constant r] in the Q| ml °g(/ 3 / ct ) lo § n j term, all but at most to ' 8 + 2em coun- 



terfeit coins are found with probability 1 — O(l/m ' 8 ). As ([Togra] + 3)(m ' 8 + 2em) queries are asked in 
(v) of the algorithm, Theorem 11.21 follows. We conclude this section by proving Lemma 13.61 

Proof of Lemma 13.61 Note that each u<i r may have one of three values 0, i0j_i jr , w{Ai^ r )- Since ii2 r 7^ 
w{Ai^r)-, U2r is either or Wj-ij. If ui r = 0, then w(Ai^ r ) / 0. If ui r = Wi-ij(= w(Ai^ij)), then 
U2r~i = while w(Ai t 2r) 7^ U2r = w{Ai-\ : j) yields w(Aip r -i) = w(Ai-ij) — w(Ai t 2 r ) 7^ 0. Particularly, 
there is £ < s such that w(Ai g) 7^ while ug = 0. Suppose the random selection other than £ is carried out. 
Then the set of coins to be weighed is either U{-Ay : selected j,j 7^ £} or \j{A^j : selected j,j 7^ £} U Aj^, 
each of which occurs with probability 1/2. Since w(Ai t g) 7^ implies that the weights of the two sets are 
different, 

Pv[w(U{A i:j : selected j}) = 0] < 1/2. 

After independently performing this [~log(i 2 + 1)] +3 times, the probability that all weights are is at most 
2TiogO +1)1+3 < 8 ^} +l ^ . That is, s increases at the next step with probability at most greq?n • 

For the second part, suppose Sg is defined but it is not good, which especially means that Sg-\ is 
good. Then Sg must be infinite or reach a step with s > 2| J\ + 8i 2 logg. As Sg starts with s = 2r._ 1 — 2, 
r^_ 1 < r e , and ii2r = w{A^2r) fo r all r < 2r £ — 2, the random walk Sg keeps going forward until s > 2r £ by 
(c) of Lemma 13.31 Let a t be the value of s when Sg reaches a step with s > 2r e for the first time. Then 

2r e <a e < 2r e + 2i 2 - 2, or < aj2 - r e < i 2 - 1, (4) 

for s increases by 2i 2 . Hence, there must be at least |_41oggJ more forwarding steps to reach a step with 
s > 2\J\ + 8i 2 logg, as, otherwise, 

s < a t + 2i 2 (L41oggj - 1) < 2r e - 2 + 2i 2 + 2i 2 ([41og gj - 1) < 2[ J| + 8i 2 log q. 

If 5^ is infinite, there must be at least [4 log gj more forwarding steps too. 

Counting after Sg reaches a step with s > 2r t for the first time, let T be the number of steps in Sg 
until there are |_41oggJ more forwarding steps. For Sg is not good, there is no correction step of u 2r , or 
equivalently s > 2r e after the count starts, particularly, T satisfies 

<r t + 2i 2 [4 log gj - 2(T - [4 log gj ) > 2r t , 

which, together with (jU, gives 

T< (i 2 + l)L41oggj +aj2-r t < (i 2 + l)(L41oggJ + 1). 
We have just shown that, for t = (i 2 + 1) ( [4 log gj + 1), 

Pr[tS^ is not good] < Pr 3 |_41oggJ forward steps among the first t or less steps of Sg . (5) 
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To bound the last probability, it is convenient to introduce an auxiliary random walk <S?. The infinite 
random walk S£ starts when Si reaches a step with s > 2r t for the first time and it is the same as Si until 
Si ends. Once Si ends, S% keeps going forward by 2i 2 with probability 8 ^i +1 ^ and backward by 2 with 

probability 1 — 8 ^ 1 +1 ^ . Then, at any step, S$ goes forward with probability at most gT^qrn ■ 

As there are \A log q\ forward steps among the first t steps of <S| if there are [4 log q\ forward steps 
among the first t or less steps of Si, © gives 

Pr[<!>£ is not good] < Pr 3 |_41og(/J forward steps among the first t steps of 5| 

|41og ? J 



which is at most ( L41 * gg j) ( 8(7^+1 
Pr[5^ is not good] < 



t 



Therefore, using (*) < (f ) k , 
1 



L4io g(? J / e(i 2 + l 41ogg +1 

< exp 4 log q In v ; _ AL ,, , ' 

~ y \ L syj 8(i 2 + 1)L4 log qJ 



L4io g e jy V8(i 2 + 1) 

Using ln(e/8) < —1 and ln(l + y) < y for y > 0, we obtain 

Pr[Si is not good] < exp ( — L41oggJ + 1) = 0{q~ A ) 



Since Si is defined for at most h q indices i by (b) of Corollary [331 an d h q = 0(q), Boole's inequality yields 
the desired bound. 

For the last bound, if Si is good, let F% be the number of all forward steps in Si after Si reaches a step 
with s > 2r e for the first time. If Si is not good or not defined, then Fi = 0. If Fi = k > 1, then must be 
good and, for the number t of all steps in Si after Si reaches a step with s > 2r f for the first time, we have 

<r t + 2i 2 k - 2(t - k) = 2r e - 2 or t = {i 2 + + aj2 - r e + 1 < (i 2 + l)(Jfe + 1), 

(recall that er £ is the value of s when Si reaches a step with s > 2r e for the first time). 

After Si reaches a step with s > 2r e for the first time, the probability that Si moves forward is at most 
gTp^Tn un t u it ends. Moreover, the bound for the probability holds regardless of F^/, £' < I. The same 
argument as above gives, for a positive integer k, 

~Pi[Fi = k\Fi, < Pr 3 k forward steps among the first t steps of 5? 
and, by (*) < (f ) fc , ln(e/8) < -1 and ln(l + y) < y for y > 0, 



Pr [ Fi = k\F 1 ,...,F i _ l] <Q(^^y<e W (kln 
The inequality still holds when k = 0. For h = h q = |_2~( l+1 )g + g 3 / 4 J , 



ef 



8/t(i 2 + 1; 



< e 



-fe+i 



Pr 

implies that 



F 1 = k 1} ...,F h = k, 



•]=iH 



Pr 



n 

[E 



Ft = k 



E 



F, 



Pr 



< e -(Eti **)+>> 



< 



fe + h 



-k+h 



Since ( fe + ft ) < (^i) h , we have 
ft 



Pr[$> 

€=1 



< exp ( /i In C ^ — - — k + h) = exp ( h In 



+ /») 
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For k > Ah - 1, hln - k + 2h < -Ak/h + 3h yields that 



h 

[E 



Pr I 2^ if> > Ah - 1 



CO _ /l CO 



fc=4/i-l £=1 k=4h-l 

Finally, for good the number of forward steps in Si is 

^-(2^-1-2) , „ r e ~ r i-i , <T t /2-T t + l 



2i 2 



+ F t 



+ 



while the number backward steps in 5^ is, 

~ (* t - (2r t - 2) + 2i 2 F,) =a e /2-r e + l + i 2 F e . 

As <jJ2 - r e < i 2 - 1 by Q, 



\ s t\Xt < 



r. - r. 



1 ^ i 2 



± + l + F e + i 2 + i 2 F i = Tl I 1 - 1 + {i 2 + l)(F t + 1). 



Therefore, 



PrJ^l^lxi > ^+A(i 2 + l)h 



<Pr[j^F/>4/i-l 

£=1 



< 2e 



-(A-4)/5 < 2e -9 3/4 /5+l 



where r* = max{r, : is good}. 

Suppose every SV is good if defined. Then there are |~ 2 l J l~( 2r ~ 2 } +Sl log<? ] more steps after u 2jlt is 
corrected, and the number |<S| of all steps in S, or equivalently in (iii) for fixed i, is 



\J\ - r* + 1 + Ai 2 logg 



< 



\J\-r* + 1 



+ 41ogg + l + ^|S £ | % . 



Thus, if Eti \Si\x < r i + A(i 2 + l)h, then 



1^1 



\S\ < f + 4(^ 2 + 1)* + ^ + 4 log g + 1< H + 4(* 2 + 2) (-^ + g 3 / 4 

By the contrapositive, if |<S| > ^ + 4(i 2 + 2) ^^tt + 9 3//4 ) > then either there is <S^ that is defined but 
not good or 



El%> %+4(i 2 + l)fc, 



^=1 



which gives 



Pr 



|5|>^ + 4(i a + 2)(^ r+ ^ 



0(g" 3 + e 



0(g~ 



□ 
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4 Finding Weighted Graphs 



In this section, we present a randomized algorithm finding weighted graphs using additive queries, where 
an additive query asks the sum of weights of edges with both ends in a fixed set. The algorithm uses coin 
weighing algorithms presented in the previous section. 

Let G = (V, E, wq) be a weighted graph with WQ{e) ^ for all e E E. We just say graphs for weighted 
graphs. First of all, it is enough to consider bipartite graphs: For general graphs, one may consider two 
disjoint copies X, Y ofV. The copy of u £ V in X and the copy of v 6 V in Y form an edge if and only if uv 
is an edge in G, and, of course, the weight is inherited. Then a query of type w(A, B) := J2xeA y&B w i x i v)-> 
A <Z X,B <ZY \s a, linear combination of four additive queries in G, that is, 

w(A,B) = w G (AuB)-w G (A\B)-w G (B\A) + w G (AnB). (6) 

In the rest of this section, we consider weighted bipartite graphs G = (X U Y, E, w) with \X\ = \Y\ = n 
and \E\ < m. A query means that one takes two sets A C X and B C Y and finds out w(A,B) := 
EaeA,eB w(a,b). 

If O(mlogn) queries are allowed, it is easy to find the graph using the randomized binary search: 

Randomized Binary Search for Graph Suppose n, m > 1 and a bipartite graph G = X L)Y with at 
most m edges and \X\, \Y\ < n is given. Then, take random subsets X',Y' of X and Y, respectively, so 
that each vertex x G X (y £ Y, resp.) in X' (Y', resp.) with probability 1/2, independently of all other 
vertices. If w(X',Y') ^ 0, find an edge there using the deterministic binary search. Otherwise, take a new 
random sets X',Y' and do it again. Stop when (2|~logn] + 5)m queries are asked. Output all edges found. 

The deterministic binary search means that divide X' into two parts X'^X^ with size difference at most 1. 
If w(X[,Y') ^ take X[, otherwise, take X' 2 . Keep doing this until a vertex x with w(x,Y') ^ is found. 
Then, find y €.Y' with w(x, y) ^ using the same method. 

If there is an edge in G, the probability of w(X',Y') ^ is at least 1/4. It may be shown that 
(2[logn] + 4 + o(l))m queries are enough to find all edges in G, with high probability. We may prove 
(2 [log n\ + 5)m queries are enough with probability 1 — e~^ m \ a proof of which is presented in Appendix. 

Lemma 4.1. The randomized binary search finds all edges of G with probability 1 — e ~^( m ). 

For a better query complexity, a more sophisticated algorithm is needed. We first present an algorithm 
finding all edges of G when the maximum degree of G is small, say at most to 0,1 . Then another algorithm 
is introduced to find vertices of large degree and edges containing them. Concatenating two algorithms, the 
following theorem may be shown. 

Theorem 4.2. Let n,m be positive integers with n 2 > m > 2 and let a, ft > be positive real numbers 
(not necessarily constants) with 2a < (3. Suppose a bipartite (weighted) graph G is given such that each 
part of G has at most n vertices and there are m or less edges in G. If the weights w(e) of edges satisfy 
a < |if(e)| < j3, then there is a randomized polynomial time algorithm that asks 0{ m log [fg ^ lQg ~ ) <7 uer * es ; 
and finds all edges with probability 1 — O(l/m ' 02 ). 

Theorem 11.11 follows from the theorem and ([6]) . 

For the first algorithm, let 5 = 0.05 and assume that the maximum degree of G is less than m 2S . To 
present the algorithm, construct a random partition X\, X m i/2+2s of X so that each vertex x E X is 
equally likely in Xj, j = 1, m 1//2+2<5 , independently of all other vertices. Similarly, construct a random 
partition Yi, 1^1/2+2,5 of Y. 

Lemma 4.3. Under the same hypotheses as in Theorem ^. £\ if the maximum degree of G is less than m 2S , 
then, with probability 1 — (1 + o(l))m~ s , the followings hold. 

(a) For each i = 1, m l / 2+25 , \N{XA\ < 2m 1 / 2 " 2 ' 5 , where N(Xi) := {y G Y : y ~ x for some x G XJ. 
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(b) For each i = 1, ...,m 1 / 2+2<5 and y EY, d(y;Xi) < 3, where d(y;Xi) := {x € Xj : a; ~ y}. 
fc^ For eac/i i = 1, m, 1//2+2<5 , £/ie number of vertices y E Y with d(y;Xi) > 2 is at most m 55 . 
fdj TTie statements (a)-(c) hold after the roles of X andY are switched. 

(e) Except for 3m 1 ~ 3S edges, every edge is a unique edge between X{ and Yj for some pair i,j. 

Proof. Let p = m -1 / 2-25 . Then, Pr[x € Xi] = p for all x and i. It is enough to show that (a)-(c) hold with 
probability 1 — o(m~ s ) and (e) holds with probability 1 — m~ s . 
For (a), as 

< S ^^pd(y) < pm = m 1 / 2-2 * 5 , 
y& 



E[\N(X i )\) = Y,{l-V*[XinN(y) 



£0-(i-p 



the generalized martingale inequality (Lemma 12. 4|) with p = m 1 / 2 2S , c x = d(x), A = m 1 / 2 2S , and 

-25 



p = m /2, gives that 



Pr 



\N H {Xi)\ >2m l l 2 - 25 



< 2exp 



m l/2-45 m -l/2-M 
^ 1 ^ 



2 e m~ 2,5 (i(a;)/2 



Since e m 2 * d M/2 < e 1 / 2 < 2 and Exex( d ( x )) 2 ^ ™ 2<5 E* e x = m l+2S , we have 

,1/2-45, 



Pr 



\N(Xi)\ > 2m 1 / 2 - 25 



< 2exp 



m 



and 



Pr 



4 

m l/2- 1* 



3i s.t. |iV(X,)| > 2m 1 / 2 - 25 ] < 2m 1 / 2+25 exp ( - = o{m- & ). 



d(y) 

4 



4< W 

P - 24 



For (b), 

Pr[d(y;Xi) > 4] < 

Thus, the probability that there is a pair y,j such that d(y,Xj) > 4 is at most 

m l/2+2<5 



V- \ - (p^(y)) 4 p 4 m 1 / 2 + 2 V65 

2^ 04 ~ 94 Z^ d ^) 



m -2-85 m l/2+2<5 Tr) l+6<5 



j=l y& 



< 



m 



y& 



24 



24m 



71 =o{m ). 



For (c), suppose the number Zj of vertices y € Y with d(y,Xi) > 2 is more than m 5S . Then there 
are distinct vertices y\, —,y m s in Y with d(yj,X{) > 2, j = 1, ...,m s , such that N(yj) n N(yk) = for all 
distinct pairs j, = 1, m* 5 . This is possible since each fixed j/6F satisfies N(y) D N(y') ^ for at most 



m 
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1 vertices y' € y. As r! > (|) r and (d(yj)) 2 < m 2d a%), 



yi,-,y m 6 j=i 



? 95 a m 9 A i5 

ep z m za \ m x - t t , /ep z m m\ m 



and 



Pr[3i s.t. > m 5<5 ] < m 1 /^ 25 



2m 



35 



o m 



For (e), the probability that an edge e = xy is not a unique edge between any pair of X{ and Yj is 

m l/2+26 

Pr[(x, y) £ Xi x Yj] Pr 3 edge between and Yj other than e (x, y) € Xj x Yj 
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Since the conditional probability is at most 

(d(x) - l)p + (d(y) - l)p + (m - d{x) - d(y) + l)p 2 < 2m 2S p + mp 2 < 3m~ 4S 

and YlTj=i P r [( x > y) & Xi xYj] = 1, the number W of edges that are not a unique edge between any pair 
of Xi and Yj is at most 3m 1_4<5 in expectation. Markov inequality implies that 

Pv[W > 3m 1 - 36 ] < m' S . 

□ 

The next algorithm finds all edges of G when the maximum degree of G is less than m 2S . 

Algorithm A (i) For each i, i = 1, m 1 / 2+2<5 , regarding each y G Y as a coin with weight Wi(y) := 
wc(Xi,y) = Y^xeXi W G( x ,y), apply the coin weighing algorithm in Corollary 11.31 to find all counterfeit 
coins with parameters (m,n,a, (3,e, fi) replaced by (2m 1//2 ~ 25 , n, a, 3/3, m~ 1//2+75 , j^g). Let iVo(Xi) be the 
set of all counterfeit coins found, i = 1, m 1 / 2 " 1 " 25 . Do the same for Yj and let Xo(Yj) be the set of all 
counterfeit coins found, j = 1, m 1 / 2+2<5 . 

(ii) For all pairs i,j = 1, ...,m l / 2+25 with |X (Xj) n Yj\ = \X t D 2Vb(l/)l = 1> take 2/ e N (Xi) D Y,- and 
x G n iVo(l^) and weigh the possible edge xy to obtain wc(x,y). For each pair xy with wc(x,y) ^ 0, 
declare that xy is an edge of G 

(iii) Find remaining edges one by one by applying the randomized binary search using no more than 
(6[~logn] + 15)m 1_3<5 queries. 

For the collectedness and the query complexity of the algorithm, we prove the following lemma. 

Lemma 4.4. Under the same hypotheses as in Theorem \4.S\ if the maximum degree of G is less than m 2S , 
then, with probability 1 — (1 + o(l))m~ s , Algorithm A asks O ( m log [fg ^ log ~ ) queries to find all edges of G. 

Proof. Suppose (a)-(e) of Lemma f4.5l hold. First, we show that the parameters (2m 1//2_25 , n, a, 3/3, m~ 1 / 2+75 ) 
satisfy all the requirements in Corollary 11.31 If y is counterfeit, then Wi(y) = wa(Xi,y) ^ 0. This gives 
y € N(Xi) and hence the number of counterfeit coins is at most |iV(Xj)| < 2m l l 2 - 2S by (a) of LemmaB 
The number of all coins is \Y\ < n. If y ~ x for only one x £ Xj, then = \wG(x,y)\ > a. Thus, 

< |^i(y)| < a implies d(y;Xi) > 2. The number of such y € Y is at most m 5S = m - l / 2 + 7 $ . 2m 1 / 2 ~ 25 by 
(c) of Lemma 14.31 Since d(y;Xi) < 3 by (b) of Lemma 14.31 \wi(y)\ < J2x€Xt \ w ( x >y)\ — 3/3. Therefore, the 
algorithm finds the set Xo(Xj) of all counterfeit coins, with probability 1 — 0(m~ 2 ) for each Xj. Similarly, 
the algorithm finds the set Nq(Yj) of all counterfeit coins, with probability 1 — 0(m~ 2 ) for each Yj. Since 
there are 2m 1 / 2+2S sets X and Yj, X (X) = {y G Y : Wi(y) ^ 0} and N (Yj) = {x G X : Wj (x) / 0}), 
with probability 1 — 0(l/m). 

If e = xy is a unique edge between Xj and Yj, then \wi(y)\, \wj(x)\ > a, especially, y G iVo(Xj) and 
x G No(Yj). Moreover, as there is no other edge between Xj and Yj, iVo(Xj) fl Yj = {y} and Xj n No(Yj) = 
{x}. Thus, the algorithm finds the edge e = xy in (ii). By (e) of Lemma 14.31 at most 3m 1_3<5 edges remain 
unfound in (ii). All the remaining edges can be found in (iii) with probability 1 — e _n ( ml 3 ) by Lemma 

EU i/2 _ 2s 

For the query complexity, in (i), 0(— log ^ ~ ) ( l uer i es are enough for each Xj or Yj. As there 

are 2m 1 / 2+2S such sets, 0( mlog {f g / ^ ) log " ) queries are enough in (i). In (ii), if |X (X)nlj| = \Xi^N Q {Yj)\ = 1, 
then there is at least one edge between Xj and Yj. As there are at most m such pairs Xi,Yj, m queries 
are enough in (ii). Since o( T ^'^" ) queries are asked in (iii), the query complexity of the algorithm is 

^./ TOlogQj/qjlogn s 
logm >■ 

□ 

For general graphs, select each vertex of Y with probability m~~ s , 5 = 0.05, independently of all other 
vertices. Let Gi be the induced graph on X and the selected vertices of Y. 



17 



Lemma 4.5. If G has at most m edges, then the fallowings hold with probability 1 — 0(m s / 2 ). 

(a) The number of edges in G\ is at most m 1-<5 / 2 . 

(b) Ifd Gl (x) > m 5 /2, then d G (x) < 2m 5 d Gl (x) < 3d G (x). 

(c) Ifd G {x) > m 2S , then d Gl {x) > m 5 /2. 

Proof. As each edge in G\ with probability m _<5 , the expected number of edges in G\ is at most m l ~ 5 . 
Markov Inequality gives 



Pr[the number of edges in G\ > m 1 5 ^ 2 ] < 



in 



-6/2 



For the degree d Gl (x) of x in G\, as E[d Gl (x)] = m s d G (x), Lemma HOI with c y = 1 if y ~ x and c y = 
otherwise, A = p = 1/2 gives, for x £ X with d G (x) < 



Pr 



..i s 

\d Gl (x) - m d d G (x)\ > — 



m s e l / 2 m 5 d G (x)\ t m 5 



<2exp( - — + a \ <2exp( - — ). 



In particular, if d G (x) < then d Gl (x) — m~ s d G (x) < m s /4, or equivalently, d Gl (x) < m s /4 + 

m~ 5 d G (x) < m 5 /2, with probability 1 - e "°( m ^ . 

For (b), it is now enough to show that d G (x) < 2m s d Gl (x) < 3d G (x) when d G (x) > ^j-, say, with 

probability 1 — e ~^ m5 \ Lemma |2~41 with c y = 1 if y ~ x and c y = otherwise, A = — — p — 1/3 a l so 
gives 



Pr 



d Gl (a;)-m d G (x)j > j < 2exp(^ + — ) < 2exp(^ — ) 



for e 1 / 3 < 3/2. If d G {x) > m 2<5 /4, we have 1 2m s d Gl (x) — 2d G (x)\ < d G (x), or equivalently, d G (x) < 
2m s d Gl (x) < 3d G (x), with probability 1 — e~ n ( m Moreover, if d G (x) > m 2S , then 2m s d Gl (x) > d G (x) > 
m 2S . That is, d Gl (x) > m s /2, which shows (c). 



□ 



Algorithm B (i) Apply the randomized binary search to find edges of G\ one by one, using (2|~logn] + 
5)m 1_,5//2 queries. Let G2 be the graph on X U Y consisting of all edges found. 

(ii) For each vertex x £ X with d G2 (x) > m <5 /2, regard each y £ Y as a coin with weight w x (y) := 
w G (x,y) and apply the coin weighing algorithm in Corollary 11.31 with parameters (m,n,a, f3,e, p) replaced 
by (2m s d G2 (x), n, a, ft, 0, 1/5). The vertices x £ X with d G2 (x) > m s /2 are called vertices of large degree. 

(iii) Output vertices of large degree and all edges found. 

Algorithm B has the following property. 

Lemma 4.6. Under the same hypotheses as in Theorem \4-S\ with probability 1 — 0(m~ s / 2 ), Algorithm B 
uses O ( m log (fg ^] " ) Q uer ">' es t° find a ^ vertices x £ X with d G (x) > m 2S and all edges containing them. 

Proof. Suppose (a) and (b) of Lemma 14.51 hold. Then Lemma 14.11 yields G% = G\ with probability 1 — 
gf^m 1 s l 2 ) _ "Yy-g assume that G\ = G2 in the rest of the proof. 

In (ii), note that the number of counterfeit coins for x is d G (x), which is at most 2m s d G2 (x) for all 
d Gl (x) = d G2 (x) > m s /2 by (b) of Lemma 14.51 Thus, the algorithm in Corollary 11.31 finds N G (x) for 
each x £ X satisfying d G2 (x) > m 5 /2, with probability 1 - 0(l/(2m 5 d G2 (x)) 1 / 5 ) = 1 - 0(l/m 2 ). As 
do( x ) > d G2 (x), there are at most 2m 1 ~ s vertices x £ X with d G2 (x) > m s /2 and the algorithm finds 
N G (x) for all such vertices x £ X, with probability 1 — 0(l/m). In particular, if d G (x) > m 2S , then 
dG 2 ( x ) > m s /2 by (c) of Lemma 1431 and hence N G (x) are found. 
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For the query complexity, (2[logn] +5)m 1 s / 2 queries are asked in (i). In (ii), 0(— — ° 2 * \°g m " ° g " ) 
queries are asked for each x € X with d G . 2 {x) > m s /2. On the other hand, d G2 (x) > m s /2 implies 
2m 5 d G2 (x) < 3d G (x) by (b) of Lemma EDS Thus, 

x:da 2 {x)>m 5 /2 x&X 

gives that O ( m log (fg °^ log n ) queries are asked in (ii). 

□ 

To find all vertices v in G with d G (v) > m 2S , one may apply Algorithm B twice, one as it is and the 
other after exchanging roles of X and Y. Then, after removing all vertices found (and all edges containing 
any of them), we apply Algorithm A. Lemmas 14.41 and 14.61 imply that 

Corollary 4.7. Under the same hypotheses as in Theorem \4-2\ there is a polynomial time randomized 
algorithm asking 0{ m log [fg ^ log " ) queries to find all edges of G, with probability 1 — O(l/m ' 02 ). 

If the algorithm in the corollary is forced to stop when it asks r ? m Igg^LS lo s n queries, for the constant 
r/ in the O ( m log ^ log n ) term, the desired algorithm in Theorem 14.21 may be obtained. 



5 Concluding Remarks 

In this paper, we presented a polynomial time randomized algorithm that uses O ( m log (og ^ lQg ~ ) queries, 
when there are at most m counterfeit coins and the weights w(c) of all counterfeit coins satisfy a < \w(c)\ < 
(3. This plays a key role to find a hidden weighted graph G satisfying similar conditions. Though there is 
a non-adaptive algorithm to find all counterfeit coins using 0( ^°f^ ) queries [11] . it is not a polynomial 
time algorithm. An obvious question is if there is a polynomial time algorithm to find all counterfeit coins 
using O C^og'm" ) Q uer i es when there is no restriction on the wights. 

The algorithm we presented was a randomized algorithm that uses the optimal number of queries up 
to a constant factor. On the other hand, the best deterministic algorithm uses © C^ogm" +w-loglogm) (see 
[9]), it would be good to implement a deterministic polynomial time algorithm that uses 0( 7 gm" ) queries 
even when the weights of counterfeit coins are positive real numbers. 
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Appendix: Proofs of Lemmas 13. 1L 13.21 and 14.1 



In this appendix, we prove Lemmas 13.11 13.21 and 14. li 

Lemma 13.11 The randomized binary search finds all counterfeit coins with probability 1 — e~^ m ) . 

Proof. If there is a counterfeit coin c, conditioned on A" = A'\ {c}, A' can be one of A" and A" U {c}, each 
with probability 1/2. Since w(A" U {c}) = w(A") + w(c) ^ w(A"), the probability of w(A') / is at least 
1/2- 

Let Zi be the number of random trials when the i th counterfeit coin is found. Then, for a = ln(4/3), 
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[e ^...A-lj-^e (1 PJ Pi- 1 _ pil _ {1 _ pi)ea - 1 _ {1 _ pi)e a< 2 _ & a' 



(1 -Pi)e a 



p,e 



for p. := Pt[w(A') ^ 0\Z U ..., Z^ x ] > 1/2. As 
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for all I = 1, m, we have 

m* 

P*[J2 Zi - 3m l - Pr[e aE ^* Zi > e 3am ] < E[e a{ ^ z '- 3m) ] < 



-2a 



2 - e a 
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where m* < m is the number of counterfeit coins, 



□ 



Lemma 13.21 Suppose a set A of n or less coins are given, and the number of counterfeit coins in A is at 
most q > 2. If the weights w(c) of all but at most q/2 counterfeit coins c satisfy \w(c)\ > a. Then, with 
probability 1 — 0(|), we have the followings. 

(a) There are at most counterfeit coins c that satisfy \w(c)\ < a (not exclusively) or belong to a set Aqj 
containing more than one counterfeit coin, j = l,...,2* s . 

(b) For each i = 1, [~21og(?] — 1, Aij contains at most ^ +2 | og9 counterfeit coins. 

(c) For each i = 1, [~21ogg] — 1, there are at most 2~^ +l \ + q 3 ^ sets Aij that contain more than one 
counterfeit coin. 

(d) Fori > [2 log 9] — 1, each Aij contains one or less counterfeit coin. 

(e) Each ^4[3i og n],j contains at most one coin. 
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Proof, (a) For any counterfeit coin c, the probability that c belongs to a set Aqj containing another 
counterfeit coin is at most 1 — (1 — 2~ iq ) q ~ 1 < 1 — 1/e. Thus, the number of such counterfeit coins c with 
|w(c)| > a is at most (1 — l/e)(q — in expectation, where q t is the number of counterfeit coins c with 
|io(c) | < a. As q — q 1 > q/2 and the number depends only on where counterfeit coins are in, we may apply 
the Azuma-Hoeffding martingale inequality (Lemma 12 .3|) with c e = 2 and c 2 < 4g to deduce that the 
number of such counterfeit coins c with |tu(c)| > a is at most 2(q — q^/3, with probability 1 — e - ^^. Thus, 
with probability 1 — e~^ q \ there are at most 

2(9 " gJ/3 + g a = 2g/3 + 9l /3 < 2g/3 + q/6 = 5q/6 

counterfeit coins c that satisfy |itf(c)| < a or belong to a set Aqj containing another counterfeit coin. 

(b) For each set Aij, the probability that Aij contains k i := [ t+2 ' og<? ] or more counterfeit coins are bounded 
from above by 

/ g\ 2 -fe.(^+i) < 2 _ fc .(^ +i )^. 

Thus, for each i = 1,..., [21ogg] — 1, the probability that Aij contains k i or more counterfeit coins is at 
most 

q' 

(c) The probability that Aij contains two or more counterfeit coins is at most 

q^ 2 -2(£ q +i) < 2 - 2 (^+i)-l g 2 ) 

and, for each i, the expected number of Aij containing two or more counterfeit coins is at most 

2 e q +i 2 -2(e q +i)-i q 2 < 2 -(e q +i)-i q 2 = 2 -(i+i) g 

Counting coordinates corresponding to counterfeit coins only, we apply the Azuma-Hoeffding martingale 
inequality (Lemma 12. 3p with c e = 1 to conclude that, for each i, the number of A% containing two or more 
counterfeit coins is at most 2~(' +1 )q + g 3 / 4 , with probability 1 — e~^ gl ^ 2 \ 

(d) For i = [21ogg] — 1, by the same estimation as in (c), the probability that Aij contains two or more 
counterfeit coins for some j is at most 2~^ + ^q < 1/q. If each ^4[2iogg]-ij' contains at most one counterfeit 
coin, then so does each Aij with i > [2 log q~\ , for every set Aij with i > [2 log q~\ is a subset of some 

A\2 log q]-l,£- 

(e) The statement follows since each set Aij with i > [2 log q\ — 1 is deterministically divide into two sets 
with size difference at most 1. 

□ 

Lemma 14.11 The randomized binary search finds all edges of G with probability 1 — e~^( m ) . 

Proof. If G has at least one edge, say e = xy, conditioned X" := X'\{x} and Y" = Y'\{y}, {X 1 , Y') can be 
one of {X", Y"), {X" U {x}, Y"), (X" , Y" U {y}), and {X" U {x}, F" U {y}), each with probability 1/4. If all 
three weights w(X", Y"), w(X" U {x}, Y"), w(X", Y" U {y}) are 0, then w(X" U {x}, Y" U {y}) = w(e) ^ 0. 
In other words, at least one of the four weights is non-zero. This yields that the probability of w(X' , Y') ^ 
is at least 1/4. 

Let Zi be the number of random trials when the i th edge is found. Then, for a = ln(13/12), 
E[e aZ i\Z u ...,Zi„ x ] =Ye ak (l-p.) k - 1 p. = [LzIilL = ^ e < 



A _ 1-p, l-{l- Pi )e a l-(l-p x )e a ~ 4-3e a 
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for p. := Pv[w{X', Y') + 0\Z U .., Z^\ > 1/4. Thus, 

m * . -An 

Pr£ > 5m] < Pr[e a ^-i z * > e 5am ] < E[e a( -^ z *~ 5m )] < ( 4 _ 3 ) < (0.97) 



m — fi(ra) 



i=i 



where m* < m is the number of edges in G, as 
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for all £ = 1, ...,m*. 



□ 
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